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I^ETHOD AND APFARATU5 FOR EFFECTIVE TRAFFIC LOCAUZATION 
THROUGH DOMAIN NAME SYSTEM 
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FIELD OF THE iMVEMTlOJ sr 



This invention relates to the field of managing communication on the 



dialosure 



Portions of the disclosure of this patent document contain material 
is subject to copyright protection. The copyright owner has no objection 
facsimile reproduction by anyone of the patent document or the patent 
as it appears in ttu? Patent and Trademark Office file or records, but 
reserves all copyright rights whatsoever. Sim, Sun Microsystems, 
Sun logo, Java, JavaBeans, Hotjava and all Java-based trademarks and 
;(>s are trademarks or registered trademarks of Sun Microsystems, Inc. in 
United States and other countries. 



BACKC^ROTJMD ART 



Computer systems sometimes rely on a server computer system to 
proiride information to requesting computers on a network. When there are 
a la rge number of requesting computers, it may be necessary to have more 
thai one server computer system to handle the requests. In prior art systems, 
25 theije is a problem in efficiently directing requests to the correct server in a 
tiple server system. 
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One area where this ha$ been a problem on the internet. The 
pnj^blem can be better understood by reviewing the structure and operation of 



PCTAJS99/t0942 



th€ internet below. 



The I;fit^tngt 



The Internet is a worldwide network of interconnected computers. An 
Internet client acces&e$ a computer on the network via an Internet provider. 

Internet provider is an organization that provides a client (e.g., an 
individual or other organization) with access to the Internet (via analog 
telephone line or httegrated Services Digital Network Une, for example). A 
10 cliejnt can, for example, read information from^ download a file from or send 
electronic mail message to another computer/client using the Internet. 



an 



To retrieve a file or service on the Internet, a client must search for the 
file or service, make a coimection to the computer on which the file or 
sen^ice is stored^ and dowrJoad the file or service. Each of these steps may 
15 inv jlve a separate application and access to multiple, dissimilar computer 
syslems. The World Wide Web (WWW) was developed to provide a 
simpler, more uniform means for accessing information on the Internet. 

The components of the WWW include browser software, network 
linlis, servers, and WWW protocols. The browser software, or browser, is a 
20 user-friendly interface (i,e., front-end) that simplifies access to the Internet. A 
broWser allows a client to communicate a request without having to learn a 
coniplicated command syntax, for example. A browser typically provides a 
gra; )hical user iaterface (GUI) for displaying information and receiving input. 
Exaonples of browsers currently availabJe include Mosaic, Netscape Navigator 
25 and! Communicator, Microsoft Internet Explorer, and Cello. 
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Information servers mamtain the information on the WWW and are 
callable of processing a client request. Hypertext Transport Protocol (HTTP) is 
standard protocol for commtmication with an information server on the 
HTTP has communication methods that allow clients to request 
from a server and send information to the server. 



the 



WWW. 



rei 
ini 



the 



server 
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dafb 

To submit a request, the client contacts the HTTP server and transmits 
the request to the HTTP server. The request contains the communication 
method requested for the transaction (e.g., GET an object from the server or 
P03T data to an object on the server). The HTTP server responds to the client 
10 by ;ending a status of the request and the requested information. The 
conjnection ia then tenninated between the client and the HTTP server. 

A client request therefore^ consists of establishing a connection 
betveen the client and the HTTP server^ performing the request, and 
tenrdnating the connection. The HTTP server does not retain any 
information about the request after the connection has been terminated. 
HrjrP is, therefore, a stateless protocol. That is, a client can make several 
qiiests of an HTTP server^ but each individual request is treated 
d jpendent of any other request. The server has no recollection of any 
previous request. 



HTTP 



Resource 



An addressing scheme is employed to identify Internet resources (e.g., 
server, file or program). This addressing scheme is called Uniform 
Locator (URL). A URL contains the protocol to use when accessing 
server (e.g., HTTP), the Internet domain name of the site on which the 
is running, the port number of the server, and the location of the 
res(>urce in the file structure of the server. 
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The WWW us(?s a concept known as hypertext. Hypertext provides 
thi ability to create links within a document to move directly to other 
mJ ormation. To activate the link, it is only necessary to click on the hypertext 
link (e,g,, a word or phrase). The hypertext link can be to information stored 
on a different site than the one that supplied the current information. A 
UllL is associated with the link to identify the location of the additional 
inJ'ormation. When the link is activated, the client's browser uses the link to 
ac:ess the data at the site specified in the URL. 

If the client request is for a file, the HTTP server locates the file and 
10 setids it to the client. An HTTP server also has the ability to delegate work to 
gateway programs. The Common Gateway Interface (CGI) specification 
defines a mechanism by which HTTP servers communicate with gateway 
programs. A gateway program is referenced using a URL, The HTTP server 
activates the program specified in the URL and uses CGI mechanisms to pass 
15 program data sent by the client to the gateway program. Data is passed from 
ths server to the gateway program via command-line arguments^ standard 
inout, or envirorunent variables. The gateway program processes the data 
ard returns its response to the server using CGI (via standard input for 
example). The server forwards the data to the client using the HTTP. 

20 A browser displays irvformation to a clicnt/ii$er as pages or documents 

(r<jferred to as "web pages'' or ''web site5")- A language i$ nsed to define the 
format for a page to be displayed in the WWW. The language is called 
Hypertext Markup Language (HTML). A WWW page is transmitted to a 
cL ent as an HTML document. The browser executing at the client parses the 

25 document and displays a page based on the information in the HTML 
d( >cument. 
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HIML is a structural language that is comprised of HTML elements 
thlaf are ne&ted within each other. An HTML document is a text file in which 
certain strings of characters, called tags, mark regions of the document and 
assign special meaning to them. These regions are called HTML elements. 
5 E£ ch element has a name, or tag. An element can have attributes that specify 
pioperties of the element Blocks or components include unordered list text 
bc xes, check boxes^ radio buttons, for example. Each block has properties such 
as name, type^ and value. The following provides an example of the 



10 



15 



25 



stiTicture of an HTML document 



re 



<HTML> 

<;HEAD> 

element(s) valid in the document head 
</HEAD> 
<BODY> 

eiement(s) valid in the document body 
</UODY> 
</HTML> 



Each HTML element is delimited by the pair of characters "<" and ">" 
The name of the HTML element is contained within the delimiting 
20 cbaracters. The combination of the name and delimiting characters is 

"erred to as a marker, or tag* Each element is identified by its marker. In 



most cases, each element has a start and ending marker. The ending marker 
is identified by the inclusion of an another character, that follows the "<" 



cbaracter. 



HTML is a hierarchical language. With the exception of the HTML 
eliment, all other elements are contained within another element. The 
HfML element encompasses the entire document. It identifies the enclosed 
te:ct as an HTML document. The HEAD element is contained within the 
HTML element and includes information about the HTML document. The 
30 BODY element is contained within the HTML. The BODY element contains 
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all] of the text and other information to be displayed. Other HTML elements 
ar^ described in HTML reference manuals. 

Domain Name Server 

A computer user navigates the internet or web from a browser on a 
5 coinpnter system. To access a web site^ the user enters the host name of the 
b site into the browser. This can be accomplished by clicking on a link^ by 
activating a tool bar button, or by manually entering a name or address into a 
lodation field and pressing "enter". The names that a browser client uses are 
knpwn as host names, such as www,sun,com for example. The name that ib 
10 enured is not the actual Internet Protocol (IP) address of the intended web 
seijver. The actual IP address is a string of numbers that uniquely locate the 
web server that provides the web site data. A worldwide distributed database 
system, called the "Domain Name System (DNS)" provides the mapping 
betjween server names and the associated IP addresses. 

15 Client application software, such as a web browser^ use a local library, 

called the "DNS resolver" to obtain the translation from server name to IP 
address* The resolver in turn contacts a predetermined local DNS server to 
obtain the translation. DNS servers can maintain caches of previously 
res:>lved names. More specifically, name resolution processes typically 
20 require two hosts on the client side. Consider a user working on 

asha.eng.sun.com" that wants to get then address of ''whitehouse.com". The 
client browser will talk with a local resolver (a library attached to the browser 
pre cess itself, in the current example running on asha.eng.stm.com). The 
)oc d resolver will go to one of a relatively small number of local name 
25 ser/ers, e.g, "n5.$un.com"* Here ns.sunxom is called the client side name 
ser/er. The client side name server will commxmicate with the outside 
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world to determine the IP address of whitehouse.com, and forward this 
in'oimarton to the resolver that is part of the browser process. 

DNS is a global network of servers that translate host names into 
nilmerical addressee (known as Internet Protocol^ or TP addresses) and 
pr3vides IP address to name mapping as well. A DNS server consi&ts of a 
name server and a resolver. The name server provides responses to resolver 
r© quests when it can by supplying the correct address for the host name 
supplied by the resolver. When the DNS server is unable to provide the 
co rrect address, it invokes its resolver to provide a solution. The resolver 
10 passes the unknown host name to another name server on the internet 
network and waits for a reply. If the next name server can provide the 
address for the host name, it does. Otherwise, the host name is forwarded to 
another name server, and this repeats until a name server is foimd that can 
translate the host name, or until it is determined that no translation exists, 
15 The server with the correct trarxslation (if any) then forwards it to the server 
thiit requested it, and that server returns it to the server that requested it from 
th«; second server, and so on until it is returned to the original name server, 
who can then return it to the client's resolver, who returns it to the brow$er. 
The originally contacted name server then stores the translation in a local 
20 cache so that the next time it sees the host name, it can supply a translation 
without asking another name server. 

Once the IP address is known, the browser communicates with the web 
server at that address to retrieve the requested web page or other information. 

The operation of the DNS network is described in; 

25 P.V. Mockapetris "Domain names - concepts and facilities*'^ RPC 1034. Nov 
19 U. 
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/ . Mockapetris "Domain names - implementation and specification", RFC 
lOpS, Nov 1^87, 

D NS Serv er Problems 

An internet server is typically limited IN the ntJtnber of clients it can 
5 efficiently service at any one time. Hov^ever, the owners of an internet site 
do not want users to be denied access to tiieir internet site. If the site is 
po pular, internet site owners desire all attempted accesses to be successful. 

To provide such service^ some companies have implemented systems 
thit allow multiple internet servers to service requests for a single internet 
10 siti?. If there are two servers^ it would be expected that each server would 

seivice approximately half of the client requests to the supported internet site. 
This concept is known as "distributed servers", in our case^ "distributed 
in^met servers". 

There are a number of schemes for implementing the distributed 
15 inljernet server. The schemes involve the manner in which requests to the 
site address are routed to the multiple server. One such scheme is 
a DNS shuffle address or "round-robin'\ In Hiis scheme, as each 
comes to the internet site address, the servers that respond are rotated 
L jsome order. If there are three servers in the distributed system, then any 
20 onje of the servers handles every third request. This scheme has a 
disadvantage of ignoring load balancing considerations and traffic 
locializdtion considerations. 

Another scheme uses a freely available script called 'Ibnamed" that 
prbvides a DNS server with the ability to retLim a different EP address for 
25 evfery client request received for a internet site host address. The returned IP 
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address 



can be made to depend on server load as well as availability of local 
inljemet servers, but ignores the relative distance between clients and the 
available servers. 



scheme 



Thi 



A product by Cisco Systems known as "DistributedDircctor" is a 
that relies on internet routing tables to provide locality information, 
i Cisco scheme imposes extra latency during servicing of DNS requests. It 
suppresses DNS caching mechanisms^ adding traffic to the internet. It 
reacts to each change in the internet routing, which changes very 
frequently. 



alS3 
ak:> 



15 {vi^ 
can 



LoJid 



20 the 



IBM has developed an "Interactive Network Dispatcher (IND) load 
balkncing product. It consists of Interactive Session Support (ISS) and a 
Network Dispatcher (ND), For TCP/IP client requests, IND chooses a server 
cluster (via ISS) and then directs the client request to the appropriate server 
ND), ND routes the request to the chosen server transparently. iTie ISS 
generate load information on servers, can perform ping triangulation 
initiated at servers to determine the ''nearest" server (cluster) to a client and 
influence client routing of requests by supplying the necessary DNS replies, 
information is collected through load monitoring agents (advisors) near 
servers. Multiple metrics are supported {e,g,, CPU, DASD, I/O). ISS 

its own DNS server implementation to generate the necessary DNS 
This scheme has the disadvantage of requiring modification of the 
exifeting DNS system. There are aiso disadvantages associated with ping 
triJ ngulation. These include increased network traffic, timing penalties in 
pesforming the ping operation, and the fact that it needs to be updated 
coittinuously to maintain accuracy. 



prc^vides 
rerlies. 
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Another approach i$ known as "smart clients" from U. C. Berkeley 
Ktsearch, The smart client approach is an architecture for web traffic client- 
server communication that allows for a dynamic server choice based on load 
ai d availability. The approach allows for the use of multiple server 

lines to achieve scalable performance, for load balancing, for fault 
tTitnsparency, and backwards compatibility with the existing addressing 

die (URLs)- The architecture reqxiires client web browsers to execute 
d<jwnloadable, service specific code. This code is divided into a GUI thread 

director thread. Server choice, load balancing, and fault transparency ate 
enjcapsulated by the director thread. A disadvantage of this scl^eme is that it 
ires cooperation of requesting clients. It imposes extensive overhead on 
sii^gle web page retrieval and sxirfing to new sites. 



fo 



10 
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Prior art scheme such as the ones described above are described in the 
lowing: 



"A Novel Server Selection Technique for Improving the Response 
Ti^ne of a Replicated Service" Zongming Fei^ Samrat Bhattacharjee, Ellen W. 
Zegura, Mostafa H. Ammar. Networking and Telecommunications Group, 
CcjUege of Computing, Georgia Institute of Technology. 

"Chapter 1. Introducing IBM's Interactive Network Dispatcher'' IBM 
20 U^rs Guide, (http://www.software,ibm,com/enetwork/dispatcher/) 

"Cisco DistributedDirector ♦ More Lmformatjon on DistributedDirector" 
(h^:/ywww,dsco.com/warp/public/751/distdir/dd_wp.htm) 

"Using Smart Clients to Build Scalable Services'' Chad Yoshikawa, 
Br^nt Chun, Paul Eastham, Amin Vahdat, Thomas Anderson, and David 
25 Cxiller. Computer Science Division, University of California, Berkeley 
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9UMMARY OP TFTR TNVENtt on 

The present invention uses a client-side computation to efficiently 
prjDvide translation of a domain name to the address of a "good" (i.e. close, 
I, nearby) server of a distributed server system. The invention uses 
application client's resolver to perform some computation to determine 
IP address nf a preferred server for that client. When the client provides 
web server name {say Virww.sun.com) to the DNS resolver, the DNS 

retm-ns data, or a smaU applet that runs at the browser's local resolver 
generate the desired IP address. The present invention is processed by the 
server (in the resolver portion) and at the client. The web server does 
need to participate. The invention does not require any changes to the 
DNS infrastructure. The invention al$o can direct cUents to more 
servers and avoid expensive "long-haul" links. The invention also 

the critical caching property of the current DNS system, has reduced 
iatincy than other schemes, less traffic for the network and DNS servers, and 
suijjports the use of secondary DNS servers. 
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BRIEF DESCRTPTTONT OF THE DRAWTNCfi 

Figure 3 is a block diagram of an example computer system that can be 
u^d with the present invention. 

Figure 2 is a diagram illustrating an embodiment of the present 
ir^venrion. 

Figure 3A is a diagram illustrating the operation of an embodiment of 
tHe present invention. 

Figure 3B is a flow diagram illustrating the operation of an 
eihbodiment of the present invention. 
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DETAILED DHSCRXPnON OF THE INVT^NTro\r 



The invention is a method and apparatus for providing effective traffic 
K realization through the Domain Name System, In the following 
d^cription, numerous sp>edfic details are set forth to provide a more 
tliorough description of embodiments of the invention. It will be apparent, 
h ?wever^ to one skilled in the art, that the invention may be practiced 
ithout these specific details. In other instances, well known features have 



n :)t been described in detail so as not to obscure the invention, 

10 The invention provides a system for providing IP addresses of servers 

ir I a distributed server system in a manner that provides efficient traffic 
k calization. The invention takes advantage of the existing DNS 
ir frastructure and uses some client processing for server selection. When a 
browser client provides an address to a DNS server, the server responds, 
15 w hen appropriate, with data^ a table of data, or a thin client applet to the 
client browser. The client's local resolver uses the data, table or applet to 
g^ine^ate the IF address of a server that is a "best" server for the client The 
s< lected server may not be the optimal server for the client, but it may be one 
tiat promotes efficient use of network resources. The invention has equal 
20 a])pljcation when the DNS server provides either data, a table of data, or an 
a])plet. For purposes of discussion, the information will generally be referred 
tc here as "table/applet". Where appropriate, specific discussion of one of the 
possible forms of the information will be made. While the ensuing 
scussion uses the web service as a canonical example of an Internet service, 
25 the invention described herein applies equally well to all other internet 

S^rviceSy including, but not hmited to^ the telnet service and the ftp service. 
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The invention is implemented via agents, servers^ and clients. The 
a^nts collect the network topology/ load information at a particular web 
server farm, and provide it to the DNS server for this service. (Here, a web 
server farm refers to a set of nearby servers that work together). The servers 
obtain the load /topology information from many agents, and use the 
inrormation to generate a single applet/table. The clients employ the 
af plet/table to choose from the available IP addresses for the given server 
ii4me. 

A diagram of the present invention is illustrated in Figure 2, In the 
10 e>jample of Figure 2, a distributed server system serves the web site 

Mrww.sun.com''. Referring to Figure 2, two web servers 201A and 201B both 
selrver the web site www.sun,com. These web servers feed load/ topology 
intbrmation about themselves to new tables/applets database 202. This 
database is coupled to the Sim primary DNS server 203. The Sun PiiniLary 
15 DWS server 205 includes a revolver library 206 and is coupled to a DNS 
database 203 and cache 204. The Sun primary DNS server 205 handles 
req[uests from other resolvers such as request 207. This request can come 
from a client browser resolver, or from other DNS servers. The DNS server 
205 handles the request and sends reply 208. 

20 When DNS server 205 receives a request to resolve the host name 

siinxom into an EP address, it provides a table/ applet as a reply 208 to the 
requesting resolver. 

The distance/metric tables are created by agents collecting network 
tobology and load information. This is done at a particular web server farm 
25 aijd the information is provided to the DNS server that serves the web site. 
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The agent is a process that is used by any distributed server system that desires 
to take advantage of the present invention. 

The agents could use any of a number of processes to obtain the 
nebessary information. For example, IBM uses ping triangulation to obtain 
5 approximate delay information. One description of collecting information 
fr4m ping requests and incorporated herein by reference is described at 
ht^:// www.internetweather.com based on 45 packet (one packet /second) 
pi^g tests with 210 byte packets between a central site and the primary domain 
server for a number of networks. Cisco uses BGP information from its own 
10 roiiters to get this information. Another way Is to get the picture from 
routing tables at major exchange points (for example, the web site 
wTKrwjnerit.edu provides routing table snapshots}* Links to informatiun 
regarding these methods may be found at the Cooperative Association for 
In^mel Data Analysis (CAIDA) at http://www.caida.org and incorporated 
15 hejrein by reference. 

The information does not have to be precise. It may be possible to 
de Einc ranges of IP addresses that define groups of servers. Consider where 
seivers are defined in groups A, B, and C. Information tliat shows that server 
clt ster hosts in group A are served by same ISP, hosts in group B have to go 
20 thr ough one major exchange, and those in group C have to go through 

ex ?ensivey under-provisioned international routes are usable in the pre$ent 
invention* 

For a load value, the machine load average may be used, 
Al lematively^ it may be suitable to measure the weighted tail average of 
25 connection set up rate, or the disk I/O rate, and normalize this by the 

midline capacity. Another option would be to periodically connect to the 
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Server process from a nearby client (or same machine)^ and use the time-to- 
ccjnnect as an estimate of the machine load. This can also be used to calibrate 
thfe server when using other metrics (connection set-up average, disk I/O etc.) 

Servers (Collating Topology /T ;^acj 



th^t 
ta 



10 



The servers are responsible for collating the topology/ load information 
the agents provide and to use this information to generate the 
5le/applet By way of example^ let us assume that there are two servers in a 
distributed server system, with IP addresses 11.11.11.11 and 22,22.22.22 

ctively. The agents could provide a table for each server as shown in 
1 and 2 below. 
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Table 1 (Server 11.11.11.11) 





Client 


Distance metric 










net x.y.0.0 


4 




net x.z.0.0 


3 




net a.b.cO 


6 




Table 2 (Server 22.22.22.22) 




Client 


EHstance metric 










net x.y.0.0 


2 




net x,z.O,0 


5 




net a.b.c.O 


3 



The server could merge together this information by going through ciU 
ihfe client tables in parallel and working through the different address blocks. 
Fc r each block^ the server could choose the best server of the dbtribxited 
servers through a combination of locality and load information. This 
in Formation can be placed in a generated table which is returned in a new 
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D^fS record type. One example of such a merg^rd table is illustrated below in 
Taile 3, 

Table 3 





Client 


Server 










net 


Srv 2? 72 7777 




net 


Srv 11.11.11.11 




net a.bx.O 


Srv 22.22.22.22 



One option would be to restrict the generated table to provide only one 
chi>ice to each client, depending on the client IP address. For example, if the 
cli^t IP address is within a certain range, one server in the server farm 
w<iuld be identified as the appropriate server for the web site. Another 
opfcion is to allow clients a small number of choices. The client can choose 
10 nearness or locality in "hard" cases. Letting a client choose among several 
choices may provide a degree of randomization that leads to improved 
balancing among several nearby servers. An alternate and more dynamic 
opfcion would have the applet periodically query a "load-server" to refresh the 
ta1|le information. (Note that in these cases, the server should manage the 
15 TTiL information that it returns to the resolver)» 

It is useful for servers to restrict the size of the tables that tltey return to 
thi clients. For backward compatibility with existing resolvers, the server 
should also put one (or more) IP addresses in the DNS address record 
relumed. 

20 Cl §lI>tS, (Servpr ChOipg) 



The client-side code resides in the resolver. It is responsible for 
seixding the DNS lookup request to the DNS server, accepting the returned 
applet/table from the DNS server, to nm the applet/table code to choose the 
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ni address that the server name maps to, and to return this IP address to the 
rdquesting application. This component should be the simplest part of this 
system - all it depends on is the language used for shipping the condensed 
server load/locality information in the applet/table. If the applet is a Java'^" 

pplet this component just executes the Java''''^ programming language 
b] decode. If the returned information is a simple table, this component looks 
u]5 the table to find the entry that corresponds to the client IP address. 

In an embodiment of the invention, the information distributed by the 
D|NS is limited to the full set of server IF addresses. Clients make the choice 
10 oi which server to contact after the DNS resolution step^ before the actual 
ti quest is made. There arc at several options to how clients con make this 
cljioice, including, but not limited to, client initiated ping triangulation, dient 
ir itiated timing service request, random choice, cost-based choice, or no 
djioice (where the system provides only one address, or the client always 
15 dliooses just one). 

The client initiated ping tri2ingulation may suffer from measurement 
ptoblems. 

The client initiated timing approach is to send a minimal service 
request to all servers listed in the table and measure the response time. For 
20 e: cample, in absence of a standard timing service request, a Web client could 
request the retrieval of a file that would not be served (e.g.Vdev/null) and 
measure the arrival of the corresponding HTTP error message. This 
[ ^proach suffers from some of the ping triangulation problems, but has no 
firewall filtering problems, for example. This approach gains in 
25 a :tractiveness if the availability of HTTP over hybrid TCP-UDP is assumed. 
Furthermore, in the scenario where the IP addresses represent server clusters. 
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th4 fake request would be intercepted by the network dispatcher and its reply 
coiiild be artificially delayed based on the load characteristics present at the 
neiwork dispatcher. 

Figure 3A illustrates the relationship between the client browser and 
5 th4 DNS server in an embodiment of the present invention. The client 
browser 390 has an assodated DNS resolver 391. When a host name is 

in the browser 390, the DNS resolver 391 initiates a request to a DNS 
392 for translation. The DNS server 392 replies to the DNS revolver 
39J with an IP address or with a table/applet 
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10 The operation of the components of Figure 3A is illustrated in the flow 

die gram of Figure 3B. At step 301 a browser user or application enters a host 
into the browser. At step 302, the browser requests a translation of the 
name from its DNS resolver. At decision block 303 the argument 
"iSjame Cached?" is made. This is to determine if the DNS resolver already 
a translation for the requested name. If the ar^ment at decision block 
is false, meaning the name is not cached, the DNS resolver contacts the 
server at step 304 and initiates a request. At step 305 the DNS resolver 
receives an answer (IF address), a table, an applet^ or an error message from 
th^ DNS server* Depending on the host name being translated, and the 
20 embodiment of the invention, the DNS server could provide any of several 
possible responses. If the host name is for a site that does not use a 
difiltributed web server system, the DNS server retturns an answer that is an IP 
adjlress. Even when the host name is for a site that uses a distributed web 
seijver system, the DNS server could retum an IP address in one embodiment 
25 of ^e invention, if it is desired to have processing and choices made at the 
D^^S server. In another embodiment of the invention, the DNS server 
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retims a table or an applet for a host name that b for a web site using a 
distributed web server systeEm. After step 305, the system proceeds to step 306. 

If the argiiment at decision block 303 is true, meaning that the DNS 
resj3lver of the browser does have the name cached, the system proceeds to 
5 decision block 306. Note that if the name is cached, it couid be translated to 
eidker an IP addre$$ (answer) or a table/applet At decision block 306 the 
arglument "Answer or Table/Applet?" is made. If the DNS response (or 
cached translation) is an answer, the DNS resoJver provides the einswer (IP 
ad^^ress) to the browser at step 307. 



blo:ic 



or a 



the 



20 312 



20 



10 If the DNS response is a table/applet the system proceeds to decision 

308. At decision block 308 it is determined whether an applet executable 
table has been returned. If it is a table, the client finds its own client IP 
in the provided table at step 309 (this may involve finding a specific 
for its EP address, or a range of IP addresses that contain its own IP 
15 adc^ress)- At step 3X0 the client links to the web server IP address returned 
basfed on its own client IP address. 



adcLress 



entry 



If at decision block 308 the client determines it has received an applet, 
client executes tiie applet bytecode at step 311 to retrieve a server IP 
. The client then links to the retrieved web server IP address at step 



adc ress 



Whether using a table or an applet the client can identify an 
appropriate server baaed on a number of factors. For example, the client may 
be ] presented with several servers to choose from, with better service available 
at tigher cost (at a fee based web site for example). The client can 
25 predetermine criteria for selecting an appropriate server. A dient may define 
connection speed as the most important factor in choosing a server. 
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irrespective of cost. Alternately, a client could define a minimum connection 
that might be met by servers of different costs at certain times of day. In 
oth^r cases, the client could preclude servers that have any extra cost 

and therefore only choose from nt>-cost servers. Clients can also 
a server based on geographical proximity to the client, or on network 
prokunity to the diem. 



ass<>ciated 



The steps of finding the web server IP address may be accomplished by 
a siinple look up, or by choosing from several possible IP addresses baaed on 
the lookup, or after a locality determination as outlined above. 

10 Embodiment of Computer ExecuHnn Environment (Hardware^ 



An embodiment of the invention can be implemented as computer 
soft A^arc in the form of computer readable code executed on a general 
pur 30se computer such as computer 100 illustrated in Figure 1, or in the form 
15 of bytecode class files executable within a Java™ runtime environment 

ruruiing on such a computer. A kejtoard 110 and mouse 111 are coupled to a 
bi-directional system bus 118. The keyboard and mouse are for introducing 
usei input to the computer system and communicating that user input to 
pro<essor 113. Other suitable input devices may be used in addition to, or in 
20 place of, the mouse 111 and keyboard 110, I/O (input/output) tmit 119 
coupled to bi-directional system bus 118 represents such I/O elements as a 
printer, A/V (audio/ video) I/O, etc. 

Computer 100 includes a video memory 114, main memory 115 and 
25 masi storage 112^ all coupled to bi-directional system bus 118 along with 
keyloard 110, mouse 111 and prcK!essor 113, The mass storage 112 may 
include both fixed and removable media, such as magnetic, optical or 
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magnetic optical storage systennLS or any other available mass storage 
tecxnoiogy. Bus 118 may contain, for example, thirty-two address lines for 
addressing video memory 114 or main memory 115. The system bus 118 also 
udes, for example, a 32-bit data bus for transferring data between and 
5 among the components, such as processor 113^ main memory 115, video 
menory 114 and mass storage 112. Alternatively, multiplex data/address 
line s may be used instead of separate data and address lines. 

In one embodiment of the invention, the processor 113 is a 
10 mii^rnprooessor manufactured by Motorola, such as the 680X0 processor or a 
' manufactured by Intel, such as the 80X86, or Pentium 
r, or a SPARC7** microprocessor from S\m Microsj^tems'^", Inc. 
r, any other suitable microprocessor or microcomputer may be 
i. Main memory 115 is comprised of dynamic random access memory 
15 (DI^M). Video memory 114 is a dual-ported video random access memory. 
; port of the video memory 114 is coupled to video amplifier 116. The 
vid^ amplifier 116 is used to drive the cathode ray tube (CRT) raster monitor 
Alternatively, the video memory could be used to drive a flat panel or 
L crystal display (LCD), or any other suitable data presentation device. 
20 Vidh? amplifier 116 is well known in the art and may be implemented by any 
I apparatus. This circuitry converts pixel data stored in video memory 
i to a raster signal siaitable for use by monitor 117. Monitor 117 is a type of 
mo^kitor suitable for displaying graphic images. 



microprocessor : 
processor. 
Hoi vever. 



utilized. 



117 



]iqv id 



suitable ^ 



114 



25 Computer 100 may abo include a communication interface 120 

coupled to bus 118, Communication interface 120 provides a two-way data 
corttmunication coupling via a network link 121 to a local network 122. For 
xa:|nple, if communication interface 120 is an integrated services digital 
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network (ISDN) card or a modem, communication interface 120- provides a 
daia communication connection to the corresponding type of telephone line^ 
which comprises part of network link 121. If communication interface 120 is 
a hkcal area network (LAN) card, communication interface 120 provides a data 
5 communication connection via network link 121 to a compatible LAN. 

Wi -eless links, modems, or cable modem links are also possible. In any such 
imj>lementation, communication interface 120 sends and receives electrical, 
ele(i;roraagnetic or optical signals which carry digital data streams 
representing various types of information. 

10 

Network link 121 typically provides data communication through one 
or more networks to other data devices. For example^ network link 121 may 
provide a connection through local network 122 to local server computer 123 
or io data equipment operated by an Internet Service Provider (ISP) 124. ISP 
in turn provides data communication services through the world wide 
packet data communication network now commonly referred to as the 
"InlBmet" 125. Local network 122 and Internet 125 both use electrical, 
electromagnetic or optical signals which carry digital data streams. The 
signals through the various networks and the signals on network link 121 

through communication interface 120^ which carry the digital data to and 
froii computer 100, are exemplary forms of carrier waves transporting the 
information. 

Computer 100 can send messages and receive data, including program 
25 cod^, through the network(s), network link 121, and communication 

120. In the Internet example, remote server computer 126 might 
a requested code for an application program through Internet 125, 
124, local network 122 and communication interface 120, 



20 anc 
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The received code may be executed by processor 113 as it is received, 
and/or stored in mass storage 112, or other non-volatile storage for later 
execution. In this manner, computer 100 may obtain application code in the 
forjn of a carrier wave. 

Application code may be embodied in any forin of computer program 
prcjduct, A computer program product comprises a mediixm configured to 
or transport computer readable code, or in which computer readable 
J may be embedded. Some examples of computer program products are 
disks, ROM cards, floppy disks^ magnetic tapes, computer hard 
drij^es, servers on a network, and carrier waves. 



stoie 



coce 
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The computer systems described above are for purpo&es of example 
only. An embodiment of the invention may be implemented in any type of 
con\puter system or programming or processing environment. 

Thus, a method and apparatus for providing effective traffic 
locAlization through the Domain Name System is described. 
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CLA IM S 

1. A method of implementing a distributed server system 
conjiprising the steps of; 

generating distance/ metric information for a web site associated with a 
server system; 

providing said information to a client when said web site is requested 
4aid client; 

using said information to identify an appropriate server of said 
server system to serve said web site to said client, 



dislributed ; 



by 



10 distributed 
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2. The method of claim 1 wherein said information comprises a 
pltt ality of ranges of IP addresses with at least one server IP address 
associated with each range. 

The method of claim 1 wherein said s^ep of identifying an 
appropriate server comprises identifying one of said plurality of ranges of IP 
containing said client's IP address and selecting said at least one 
IP address corresponding to said range. 



add resses ' 



server 



4. The method of claim 1 wherein said plurality of ranges of IP 
addtesses has a plurality of server IP addresses associated with each range. 

5» The method of claim 4 wherein said step of identifying an 
25 appjropriate server comprises identifying one of said plurality of ranges of IP 
addresses containing said client's IP address and selecting said one of said 
pluiiality of server EP address corresponding to said range. 
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6. The method of claim 5 wherein said step of selecting one of said 
plurality of server IP addresses is accomplished by randomly selecting one of 
saiji plurality of server IP addresses. 

7, The method of claim 5 wherein said step of selecting one of said 
plT^ality of server IP addresses is accomplished by identifying a closest one of 
saifl plurality of server IP addresses. 



10 tab 



9. The method of claim 1 wherein said information comprises an 
exebutable file* 



15 
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8. The method of claim 1 wherein said information comprises a 



e. 



10. The method of claim 9 wherein said executable file comprises an 



applet. 



11, The method of claim 1 wherein said step of generating 
dislance/metric information is accomplished by the use of agents. 

12. The method of claim 11 wherein said agents generate said 
disiance/metric information by performing ping trianguladons. 



13. The method of claim 11 wherein said agente generate said 
25 disijance/metric information by collecting BGP information, 

14. The method of claim 11 wherein said agents generate said 
disijance/metric information by the use of routing tables. 
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15, The method of claim 5 wherein said step of selecting one of said 
plurality of server IP addresses is accomplished by determining a cost 
asitociated with each of sadd plurality of server IP addresse-S and selecting one 
said plurality of server IP addresses based said cost. 



27 
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16. An computer program product having computer readable 
program code comprising: 

computer readable program code configured to cause a computer to 
10 geiierate distance/metric information for a web site associated with a 
distributed server system; 

computer readable program code configured to cause a computer to 
pr(j>vide said information to a client when said web site is requested by said 



cliimt; 



computet readable program code configured to cause a computer to use 
saii information to identify an appropriate server of said distributed server 
sys tern to serve said web site to said client. 



17, The computer program product of claim 16 wherein said 

comprises a plurality of ranges of IP addresses with at least one 
serk^er IP address associated with each range. 



20 information i 



18. The method of claim 16 wherein identifying an appropriate 
server comprises identifying one of said plurality of ranges of IP addresses 
25 coiLtairung said client's IP address and selecting said at least one server IP 
address corresponding to said range. 
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19. The computer program product of claim 16 wherein said 
pliirality of ranges of IP addresses has a plurality of server IP addresses 
associated with each range. 



an 
IP 

pli 



20. The computer program product of claim 19 wherein identifying 
appropriate server comprises identifying one of said plurality of ranges of 
iddresses containing said client's IP address and selecting said one of said 
ality of server IP address corresponding to said range, 



20 
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10 21. The computer program product of claim 20 wherein seJecting 

on^ of said plurality of server IP addresses is accomplished by randomly 
selk:ting one of said plurality of server IP addresses. 

22, The computer program product of claim 20 wherein selecting 
15 oni of said plurality of server IP addresses is accomplished by identifying a 
dofcest one of said plurality of server IP addresses. 



23. The computer program product of claim 16 wherein said 
information comprises a table* 

24. The computer program product of claim 16 wherein said 
inf<|>rmation comprises an executable file. 



25- The computer program product of claim 24 wherein said 
25 exe|:nitable file comprises an applet 

26. The computer program product of claim 16 wherein generating 
dist ance /metric information is accomplished by the use of agents. 
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27. The computer program product of claim 26 wherein said agents 
geiierate said distance/metric information by performing ping triangulations. 

28. The computer program product of claim 26 wherein said agents 
geijierate said distance/metric information by collecting BGP information. 

29. The computer program product of claim 26 wherein said agents 
generate said distance /metric information by the use of routing tables. 

30. The computer program product of claim 20 wherein selecting 
on^ of said plurality of server IP addresses is accomplished by determining a 
cos^t associated with each of said plurality of server IP addresses and selecting 
on^ of said plurality of server IP addresses based said cost. 
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